public transportation
Exploring the Potential of Large Language Models in Public Transportation: San Antonio Case Study
Jonnala, Ramya, Liang, Gongbo, Yang, Jeong, Alsmadi, Izzat
The integration of large language models (LLMs) into public transit systems presents a transformative opportunity to enhance urban mobility. This study explores the potential of LLMs to revolutionize public transportation management within the context of San Antonio's transit system. Leveraging the capabilities of LLMs in natural language processing and data analysis, we investigate their capabilities to optimize route planning, reduce wait times, and provide personalized travel assistance. By utilizing the General Transit Feed Specification (GTFS) and other relevant data, this research aims to demonstrate how LLMs can potentially improve resource allocation, elevate passenger satisfaction, and inform data-driven decision-making in transit operations. A comparative analysis of different ChatGPT models was conducted to assess their ability to understand transportation information, retrieve relevant data, and provide comprehensive responses. Findings from this study suggest that while LLMs hold immense promise for public transit, careful engineering and fine-tuning are essential to realizing their full potential. San Antonio serves as a case study to inform the development of LLM-powered transit systems in other urban environments.
Multi-Granularity Semantic Revision for Large Language Model Distillation
Liu, Xiaoyu, Zhang, Yun, Li, Wei, Li, Simiao, Huang, Xudong, Chen, Hanting, Tang, Yehui, Hu, Jie, Xiong, Zhiwei, Wang, Yunhe
Knowledge distillation plays a key role in compressing the Large Language Models (LLMs), which boosts a small-size student model under large teacher models' guidance. However, existing LLM distillation methods overly rely on student-generated outputs, which may introduce generation errors and misguide the distillation process. Moreover, the distillation loss functions introduced in previous art struggle to align the most informative part due to the complex distribution of LLMs' outputs. To address these problems, we propose a multi-granularity semantic revision method for LLM distillation. At the sequence level, we propose a sequence correction and re-generation (SCRG) strategy. SCRG first calculates the semantic cognitive difference between the teacher and student to detect the error token, then corrects it with the teacher-generated one, and re-generates the sequence to reduce generation errors and enhance generation diversity. At the token level, we design a distribution adaptive clipping Kullback-Leibler (DAC-KL) loss as the distillation objective function. DAC-KL loss exploits a learnable sub-network to adaptively extract semantically dense areas from the teacher's output, avoiding the interference of redundant information in the distillation process. Finally, at the span level, we leverage the span priors of a sequence to compute the probability correlations within spans, and constrain the teacher and student's probability correlations to be consistent, further enhancing the transfer of semantic information. Extensive experiments across different model families with parameters ranging from 0.1B to 13B demonstrate the superiority of our method compared to existing methods.
- North America > United States (0.04)
- Asia > China > Hong Kong (0.04)
- Transportation > Passenger (1.00)
- Transportation > Ground > Road (1.00)
- Education (1.00)
The OPNV Data Collection: A Dataset for Infrastructure-Supported Perception Research with Focus on Public Transportation
Vosshans, Marcel, Baumann, Alexander, Drueppel, Matthias, Ait-Aider, Omar, Woerner, Ralf, Mezouar, Youcef, Dang, Thao, Enzweiler, Markus
This paper we present our vision and ongoing work for a novel dataset designed to advance research into the interoperability of intelligent vehicles and infrastructure, specifically aimed at enhancing cooperative perception and interaction in the realm of public transportation. Unlike conventional datasets centered on ego-vehicle data, this approach encompasses both a stationary sensor tower and a moving vehicle, each equipped with cameras, LiDARs, and GNSS, while the vehicle additionally includes an inertial navigation system. Our setup features comprehensive calibration and time synchronization, ensuring seamless and accurate sensor data fusion crucial for studying complex, dynamic scenes. Emphasizing public transportation, the dataset targets to include scenes like bus station maneuvers and driving on dedicated bus lanes, reflecting the specifics of small public buses. We introduce the open-source ".4mse" file format for the new dataset, accompanied by a research kit. This kit provides tools such as ego-motion compensation or LiDAR-to-camera projection enabling advanced research on intelligent vehicle-infrastructure integration. Our approach does not include annotations; however, we plan to implement automatically generated labels sourced from state-of-the-art public repositories. Several aspects are still up for discussion, and timely feedback from the community would be greatly appreciated. A sneak preview on one data frame will be available at a Google Colab Notebook. Moreover, we will use the related GitHub Repository to collect remarks and suggestions.
- Europe > Germany > Baden-Württemberg > Stuttgart Region > Stuttgart (0.04)
- Europe > Netherlands > South Holland > Delft (0.04)
- Oceania > Australia (0.04)
- (11 more...)
- Transportation > Infrastructure & Services (1.00)
- Transportation > Ground > Road (1.00)
Researchy Questions: A Dataset of Multi-Perspective, Decompositional Questions for LLM Web Agents
Rosset, Corby, Chung, Ho-Lam, Qin, Guanghui, Chau, Ethan C., Feng, Zhuo, Awadallah, Ahmed, Neville, Jennifer, Rao, Nikhil
Existing question answering (QA) datasets are no longer challenging to most powerful Large Language Models (LLMs). Traditional QA benchmarks like TriviaQA, NaturalQuestions, ELI5 and HotpotQA mainly study ``known unknowns'' with clear indications of both what information is missing, and how to find it to answer the question. Hence, good performance on these benchmarks provides a false sense of security. A yet unmet need of the NLP community is a bank of non-factoid, multi-perspective questions involving a great deal of unclear information needs, i.e. ``unknown uknowns''. We claim we can find such questions in search engine logs, which is surprising because most question-intent queries are indeed factoid. We present Researchy Questions, a dataset of search engine queries tediously filtered to be non-factoid, ``decompositional'' and multi-perspective. We show that users spend a lot of ``effort'' on these questions in terms of signals like clicks and session length, and that they are also challenging for GPT-4. We also show that ``slow thinking'' answering techniques, like decomposition into sub-questions shows benefit over answering directly. We release $\sim$ 100k Researchy Questions, along with the Clueweb22 URLs that were clicked.
- North America > United States > New York > New York County > New York City (0.04)
- Asia > Middle East > Jordan (0.04)
- North America > United States > Washington > King County > Seattle (0.04)
- (11 more...)
- Transportation > Infrastructure & Services (1.00)
- Leisure & Entertainment (1.00)
- Law (1.00)
- (6 more...)
As corporate America pivots to AI, consumers rejected for loans, jobs
Most days she is able to live comfortably without a car. She works remotely often but occasionally she needs to go into the office. That's where her situation gets a bit challenging. Her workspace is not easily accessible by public transportation. Because she doesn't need to drive often she applied for the car-sharing platform Zipcar to fulfill her occasional need.
- North America > United States > New York > Kings County > New York City (0.05)
- North America > United States > Connecticut (0.05)
- Banking & Finance > Credit (0.50)
- Transportation > Ground > Road (0.49)
Climate Change from Large Language Models
Climate change presents significant challenges to the global community, and it is imperative to raise widespread awareness of the climate crisis and educate users about low-carbon living. Artificial intelligence, particularly large language models (LLMs), have emerged as powerful tools in mitigating the climate crisis, leveraging their extensive knowledge, broad user base, and natural language interaction capabilities. However, despite the growing body of research on climate change, there is a lack of comprehensive assessments of climate crisis knowledge within LLMs. This paper aims to resolve this gap by proposing an automatic evaluation framework. We employ a hybrid approach to data acquisition that combines data synthesis and manual collection to compile a diverse set of questions related to the climate crisis. These questions cover various aspects of climate change, including its causes, impacts, mitigation strategies, and adaptation measures. We then evaluate the model knowledge through prompt engineering based on the collected questions and generated answers. We propose a set of comprehensive metrics to evaluate the climate crisis knowledge, incorporating indicators from 10 different perspectives. Experimental results show that our method is effective in evaluating the knowledge of LLMs regarding the climate crisis. We evaluate several state-of-the-art LLMs and find that their knowledge falls short in terms of timeliness.
- North America > Canada > Ontario > Toronto (0.04)
- Europe > Sweden > Halland County > Halmstad (0.04)
- Health & Medicine (0.93)
- Energy > Renewable (0.68)
- Transportation > Ground > Road (0.68)
Prompt, Condition, and Generate: Classification of Unsupported Claims with In-Context Learning
Christensen, Peter Ebert, Yadav, Srishti, Belongie, Serge
Unsupported and unfalsifiable claims we encounter in our daily lives can influence our view of the world. Characterizing, summarizing, and -- more generally -- making sense of such claims, however, can be challenging. In this work, we focus on fine-grained debate topics and formulate a new task of distilling, from such claims, a countable set of narratives. We present a crowdsourced dataset of 12 controversial topics, comprising more than 120k arguments, claims, and comments from heterogeneous sources, each annotated with a narrative label. We further investigate how large language models (LLMs) can be used to synthesise claims using In-Context Learning. We find that generated claims with supported evidence can be used to improve the performance of narrative classification models and, additionally, that the same model can infer the stance and aspect using a few training examples. Such a model can be useful in applications which rely on narratives , e.g. fact-checking.
- Europe > Ireland > Leinster > County Dublin > Dublin (0.04)
- Europe > Belgium > Brussels-Capital Region > Brussels (0.04)
- North America > United States > Virginia > Arlington County > Arlington (0.04)
- (12 more...)
- Law (1.00)
- Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
- Health & Medicine > Therapeutic Area > Immunology (1.00)
- (9 more...)
The Optimized path for the public transportation of Incheon in South Korea
faradunbeh, Soroor Malekmohammadi, Li, Hongle, Kang, Mangkyu, Iim, Choongjae
Path-finding is one of the most popular subjects in the field of computer science. Pathfinding strategies determine a path from a given coordinate to another. The focus of this paper is on finding the optimal path for the bus transportation system based on passenger demand. This study is based on bus stations in Incheon, South Korea, and we show that our modified A* algorithm performs better than other basic pathfinding algorithms such as the Genetic and Dijkstra. Our proposed approach can find the shortest path in real-time even for large amounts of data(points).
- Asia > South Korea > Incheon > Incheon (0.61)
- Asia > South Korea > Daegu > Daegu (0.05)
- North America > United States (0.04)
- Europe > Netherlands > North Holland > Amsterdam (0.04)
- Research Report (0.50)
- Workflow (0.46)
- Transportation > Passenger (1.00)
- Transportation > Infrastructure & Services (1.00)
- Transportation > Ground > Road (1.00)
Israel launches $17m self-driving public bus project - Al-Monitor: Independent, trusted coverage of the Middle East
Four consortia of international and Israeli companies have been chosen to operate a two-year pilot program to test autonomous public transportation in Israel. The Nov. 7 announcement by the Transportation Ministry and the Israel Innovation Authority follows a call for proposals issued in September 2021. In additional to Israel, the consortia includes companies from France, the U.S., Turkey and Noway. The first phase of the NIS61 million ($17.75 million) pilot will consist of experiments at test and operational sites, while the second will be conducted under a temporary license along public transportation lanes. The pilot follows Knesset legislation approved in March 2022 to develop a knowledge base regarding the safety of independent vehicles.
- Asia > Middle East > Israel (1.00)
- Europe > Middle East (0.40)
- Africa > Middle East (0.40)
- (2 more...)
Driverless car projects: our pick of 10 favorites - DesignWanted : DesignWanted
There was a time when no one could imagine a driverless car would ever exist. But gradually, what we once thought was impossible has become a reality. The first autonomous cars are now commercially available! Although Leonardo da Vinci developed the self-propelled carriage in the 15th century, it was in the 20th century that the concept was realized. When Google announced in 2009 that it would start researching unmanned cars, the idea became even more attractive. Currently, several well-known companies are looking into developing semi-autonomous and fully driverless cars, which could result in significantly fewer traffic accidents.
- Asia > China > Hong Kong (0.06)
- North America > United States > California > San Francisco County > San Francisco (0.05)
- Transportation > Passenger (1.00)
- Transportation > Ground > Road (1.00)
- Information Technology > Robotics & Automation (1.00)
- Automobiles & Trucks (1.00)